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ABSTRACT UDP-GlcN Ac: a-3-D-mannoslde 0-1,2 <N« 
acHytgtuo»amlnyttnukrfej^ I (GnT I; EC 2.4.1.101) cata- 
lyzes an essential first step In the conversion of high-mannose 
tf-gtycans to hybrid awl complex AT-gfycaos. Cloning of the 
gene enrodfrtg this enzyme was carried oat by mixed ofigoav- 
cleotide-primed polymerase chain reaction ainpifkali un of 
rabbit Ihcr single-stranded cDNA nsing sense aiidairtlaenae 20- 
to 24-base-pair (bp) primers. A rabbit Hrer Horary hi phage 
AgtlO yielded a 2.5-Uohase (kb) cDNA with a 447-arnino add 
coding sequence. None of the nine asparagfne residues were In 
an Asn-Xaa-<Ser or Thr) sequence, indicating that the protein 
Is not N-g frc osy tat ed. There is no seqnence hoenolog y to other 
previously cloned glycoayUransferasea, bat GnT I appears to 
have a domain stnictsre typkal of these enzymes — l.e., a short 
amino-ternnnal domain, a transmembrane domain, a "neck" 
region, and a large cartoxyt-tennmal catalytic domain. UNA 
was tranacrlbed off the 2.5-fcb cONA, and im vitro translation 
with rabbit reticulocyte tysate yiesded a 52-kDa protein wtth 
GnT I activity. 



The biosynthesis of highly branched N- and 0-glycans is 
potentially important to many biological phenomena (1-3). 
All /V-glycans share the common core structure Mano6- 
(Mano3)Man^4GlcNAc^4GlcNAc^Asn. Complex /V-gly- 
cans have branches that are initiated by the action of Golgf- 
localized GlcN Ac -transferases designated GnT I to VI (4). 
The conversion of high-mannose /V-glycans to complex and 
hybrid JV-glycans is controlled by UDP-GlcNAc:a-3-D- 
mannostde 0-1 ,2-N-^w^tylghicc«aminyltraiwferasc I (GnT I; 
EC 2.4.1.101), which catalyzes the reaction: 

UDP-GlcNAc + (Mano6[Mana31Manaf5XMa2ia3)Man^4R 
(Mano6(Mana3}Maria6XG/c/VAr/32Mana3)Man^4R + UDP, 

where R is GlcN Ac04( + / - FucooXjIcN Ac- Asn-Xaa. 

GnT I is essential for subsequent action of several enzymes 
in the processing pathway (5-8)— i.e., GnT II, III, and IV 
require the prior action of GnT I, and GnT V and VI require 
the prior action of GnT II. GnT I has been reported in hen 
oviduct, Chinese hamster ovary cells, baby hamster kidney 
cells, bovine colostrum, pig trachea, and mammalian b'ver (5, 
6, 9, 10). The enzyme has been partially purified from bovine 
colostrum (11), from pig liver and trachea (12), and, to 
homogeneity, from rabbit liver (13, 14). 

We have used mixed oligonucleo tide-primed polymerase 
chain reaction (PCR) amplification (15, 16) to clone a 2.5- 
kilobase (kb) cDN A coding for rabbtt liver GnT I. The protein 
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contains 447 amino acids and has a domain structure typical 
of gly cosy (transferases — i.e., a short amino-terminal do- 
main, a transmembrane domain, a "neck** region, and a large 
carboxyl terminal catalytic domain. Thus, the gene encoding 
a medial GolgMocahzed glycosyltransferase has been cloned 
and the sequence determined J 1 

MATERIALS AND METHODS 

Preparation of Peptides. Glycerol, Triton X-100, and salts 
were removed from 15 /ig of purified enzyme (14) by "in- 
verse-gradient" reversedrphase HPLC (RP-HPLQ (17). The 
enzyme solution (100 /d) was diluted to 1.2 ml with 1 -pro- 
pane! and loaded on a VeloSep C 8 cartridge (3-pm particle 
size, 30 x 2.1 mm i.d.; Applied Biosy stems) equilibrated in 
100% 1-propanol at 40°C. GnT I was ehited at 0.1 ml/min by 
a linear gradient (5%/min) of decreasing 1-propanol concen- 
tration (100-50%) generated with 100% 1-propanol and 50% 
l-propanol/50% water containing 0.4% (vol/vol) trifiuoro- 
aceuc acid at 40°C. The GnT I-containing fraction was 
adjusted to 0.02% (wt/vol) with respect to Tween 20 (Pierce), 
concentrated to 100 id under vacuum, and diluted to 1.5 ml 
with 5% (vol/vol) formic acid containing 0.02% Tween 20. 

Edman degradation of purified GnT I (-200 pmol) yielded 
no ammo-terminal sequence. GnT I was digested with pepsin 
(Sigma) at an enzyme/substrate mass ratio of 1:20 for 1 hr at 
37°C, and the digest was fractionated by RP-HPLC to yield 
peptides 5 and 6 (Fig. 1). Core GnT I remaining after pepsin 
digestion was reduced with dithiothreitoi and alkylated with 
iodoacctk acid (18) to give core S-carboxy methylated(SCM> 
GnT I, which was purified by RP-HPLC (18, 19). Pepsin- 
treated core SCM-GnT I (10 /ig in 1 ml of 1% ammonium 
bicarbonate/1 mM CaCl2/0.02% Tween 20) was digested 
with trypsin (Worthington) at an enzyme/substrate mass 
ratio of 1:20 for 16 hr at 37°C. Trypsin resulted in tittle further 
digestion of the pepsin-treated material. Sequence analysis of 
a portion of this material resulted in 33 amino acid assign- 
ments (peptide 1 in Fig. 1). Pepsin and trypsin- treated core 
SCM-GnT I (8 jig in 1 ml of 1% ammonium bicarbonate/ 
0.02% Tween 20) was digested with thermolysin (Sigma) at an 
enzyme/substrate mass ratio of 1:20 for 2 hr at 50°C, and the 
digest was fractionated by RP-HPLC to yield peptides 2, 3, 
4, 7, and 8 (Fig. 1). 



Abbreviations: GoT I, UDP-GlcNAc:a3-r>mannoside 0-1, 
acctytgjucosammyltiarisferase I; PCR, polymerase chain reaction; 
RP-HPLC, reversed-phase HPLC; R. GlcNAc04(±Fuca6)GlcNAc- 
Asn-Xaa. 

[1 Thc sequence reported in this paper has been deposited in the 
GenBank data base (accession no. M57301). 
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QligQattclaotldai : 

28 : 5 ' - TGGGCl QAACTIGAACC I AAATGG- 3 * 
GT G O 

2A:3 • - CCATTT IGGITCI AGTTCIGCCCA- 3 » 
C C AC 

33:5' -TTTTGGGATGATTGGATGCO-3 ' 
C C C A 

JA : 5 * -CGCATCCAATCATCCCAAAA- 3 ' 
9 GO G 

€8:5' ~CAAACXTGGGA*GGXYATGATCC-3 ' 
0 C C C 

6A : S * -GGATCATAICCATCCCArGTTTG-3 ' 
G G G C 

Fig. 1. Amino acid sequence data for the eight peptides isolated 
from rabbit liver GnT I and nucleotide sequences of the six synthetic 
oligonucleotides prepared on the basis of the peptide sequences. 
Single letter code is used for amino acid sequence data; upper case 
letters indicate firm assignments, and lower case letters indicate 
tentative assignments. Underlined sections of the peptide sequences 
indicate regions used for the design of oligonucleotide probes. Probes 
2, 3, and 6 were based on peptides 2,3, and 6, respectively; S indicates 
"sense" and A indicates "antisense" directions. 

HPL€. RP-HPLC was carried out on a Hewlett-Packard 
liquid chromatograph (model 1090A) fitted with a diode array 
detector (model 1040A) (18). A Brownlee RP 300 column 
(30-nm-pore-size, 7-/ira-diameter dime thy loctyl silica parti- 
cles packed into a stainless steel cartridge, 30 x 2.1 mm i.d.; 
Brownlee Laboratories) was Used for all peptide separations. 

Ammo Add Sequence Analysis. Automated amino acid 
sequence analysis of GnT I and derived peptides was per- 
formed with Applied Biosystems sequencers (models 470A 
and 477A) equipped with on-line phenylthiohydantoin- 
conjugated amino acid analyzers (model 120 A). Polybrene 
(20) was used as a carrier. 

Oligonucleotides and cDNA Synthesis. Oligonucleotides 
were synthesized on a Pharmacia automated oligonucleotide 
synthesizer at the Hospital for Sick Children-Pharmacia 
Biotechnology Service Centre. Total RNA was prepared 
from rabbit liver by the method of Chirgwin et ai (21, 22). 
Poly(A)* RNA was prepared by oligo(dT) chromatography 
(23). Single-stranded cDNA synthesis was performed by 
using the RiboClone cDNA synthesis system (Proroega), 
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total rabbit liver RNA (20 jig) as template, and a mixture of 
the three antisense oligonucleotide primers 2A, 3A, and 6A 
(Fig. 1) at concentrations of 50 nM each as primers. 

Amplication of cDNA. PCR was carried out in a total 
volume of 0. 1 ml containing 50 mM KC1, 10 mM TrisHCl (pH 
8.3), 1.5 mM MgCl 2 , 0.01% gelatin, each of the four dNTPs 
at 0.2 mM, 0.5 /jM of each oligonucleotide in six paired 
combinations of oligonucleotide primers (2S, 3 A; 2S, 6A; 3S, 
2A; 3S, 6A; 6S, 2A; and 6S, 3A; Fig. 1), 10 $d of RNA-free 
rabbit liver cDNA (see above), 2.5 units of Thermus aquat- 
icus (Tag) polymerase (Perkin-Elmer/Cetus), and 0.1 ml of 
mineral oil. The samples were subjected to a temperature- 
step cycle of 94°C (0.5 rain), 50°C (1 min), and 72°C (2 min) 
for a total of 40 cycles, followed by a 10- min extension at 72°C 
after the final cycle. DNA from the PCR reactions showed 
two products (0.45 and 0.50 kb), which were blunt-end* 
ligated into the Sma I site of pGEM-7Z (Promega) (16). The 
recombinant plasmids were amplified in Escherichia coli 
XL 1 -blue cells and purified. 

Screening of Rabbit Liver cDNA Library in AgtlO. The 
pGEM-7Z/0.5-kb PCR product plasmid was cut with BamHl 
and used to generate a riboprobe with the Promega Ribo- 
probe Gemini II core system. Twenty-five microliters of 
reaction mixture contained 32 mM Tris-HCl (pH 7.5), 5 mM 
MgCl 2 t 2 mM spermidine, 8 mM sodium chloride, 8 mM 
dithiothreitol, 40 units of RNasin, 0.4 mM each of ATP, GTP, 
and UTP, 5 /il of (o> 32 PJCTP (800 Ci/tnmol; 1 Ci = 37 GBq), 
1 $m% of AomHI-cut pGEM-7Z/PCR product plasmid, and 2 
units of T7 RNA polymerase. Incubation was at 40°C for 2 nr. 
RNasc-free DNasc I (10 units) was added, followed by 
incubation at room temperature for 15 min. Buffer (80 of 
50 mM TrisHCl, pH 7.4/4 mM EDTA/300 mM NaCl/0.1% 
SDS) and tRNA (20 fig) were added, followed by extraction 
with phenol/chloroform/isoamyl alcohol, 25:24:1 (vol/vol). 
The labeled RNA probe was desalted over a Sephadex G-50 
column (NICK column, Pharmacia) and used to screen 10* 
plaques of a rabbit liver cDNA library in AgtlO (5'-stretch, no. 
TL1006a from Clontech, EcoRl cloning site) (24). The largest 
cDNA insert obtained on £coRI digestion was 1.6 kb; it was 
subcloned into pGEM-7Z (Promega) (24), and plasmid DNA 
was purified (22). The cDNA library was re screened with a 
80-bp riboprobe from the 5' end of the 1.6-kb clone. The 
largest cDNA insert (3.0 kb) was subcloned into pGEM-7Z. 

DNA Sequencing. Two colonies of the pGEM-7Z/PCR- 
product recombinant plasmid (see above) containing inserts 
in opposite directions were sequenced directly by the single- 
strand mdeoxynucleodde-cham-terniination method (25) us- 
ing deoxyadenosine 5'-[o>( 33 S)thio]triphosphate, Sequenase 
(United States Biochemical) and the forward primer for 
pGEM-7Z. The 1.6- and 3.0-kb clones were sequenced by the 
Erase-a-Base System (Promega) and the single-strand 
mdeoxynucleoUde-cham-termination method. Both DNA 
strands were sequenced by using colonies in which the inserts 
were present in opposite directions. 

RNA Hybridization. Rabbit liver poIv(A) + RNA (5 pg) was 
denatured in 50% (vol/vol) formamide/6% (vol/vol) formal- 
dehyde buffer at 65°C and was resolved by gel electropho- 
resis in a 1% agarose gel containing 6% formaldehyde. The 
RNA was transferred to a nitrocellulose filter, and the filters 
were hybridized with the "P-labeled 0.5-kb PCR riboprobe 
(see above) followed by autoradiography. The specific ac- 
tivity of the probe was about 10* dpm/ng, and the hybrid- 
ization solution contained about 10* dpm/ml. 

in Vitro Transcription and Translation. The recombinant 
plasmid containing pGEM-7Z (Promega) and the 2.5-kb GnT 
I cDNA insert (rc2500 in Fig. 2) was cut with Sph I to generate 
linear plasmid. RNA was transcribed by using the phage SP6 
RNA polymerase promoter and initiation site present in 
pGEM-7Z. RNA synthesis was carried out at 40°C for 1 hr in 
a total volume of 50 /d containing 40 mM Tris-HCl (pH 7.5); 
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Fio. 2. Schematic representation of GnT 1 clones. "PCR prod- 
uct 1 * is the product obtained by PCR amplification of rabbet liver 
cDN A; "rclMO" is the 1 .6 kb-CnT 1 cDNA clone; and "rc2500" is the 
3.0-kb GnT 1 cDNA clone. The shaded boxes represent the coding 
region. During subcloning, the 3.0-kb cDNA was reduced to 2.5 kb 
by a 0.5-kb deletion at the 5' end. 

6 mM MgCI 2 ; 2 mM spermidine; 10 mM NaCl; 10 tnM 
dithiothreitol ; 40 units of RNasin (Promega); 0.5 mM each of 
ATP, UTP, and CTP; 0.1 mM GTP; 0.5 mM m'Gtf'frpptf'Xj 
(Pharmacia); 10 units of SP6 RNA polymerase; and 10 p% of 
linearized plasmid. Control incubations were carried out in 
the absence of plasmid or with a linearized pGEM-7Z re- 
combinant plasmid containing a noncoding insert. The reac- 
tion mixture was extracted twice with phenol/chloroform/ 
isoamyl alcohol, followed by precipitation with cold ethanol. 

Protein synthesis (translation) was carried out at 30°C for 
1 hr in a total volume of 50 containing all 20 amino acids 
(1 mM each), 20 units of RNasin, RNA as prepared above, 
and buffer and rabbit reticulocyte lysate as supplied by 
Promega (26). Nonradioactive amino acids were used .when 
the products of translation were assayed for GnT I activity 
(see below). Separate incubations were carried out with 
L-[ 35 S]methiooine (1000 Ci/mmot; 90 *tCi per incubation) 
replacing nonradioactive methionine; these incubations were 
analyzed by SDS/po4yacrylamide gel electrophoresis fol- 
lowed by autoradiography. 

GnT I was assayed with 0.6 mM Man a6( Man a3) Mart 0- 
hexyl (a gift from Hans Paulsen, University of Hamburg), and 
the product was isolated either with Sep-Pak Ci« reverse- 
phase cartridges (Waters) (27) or by HPLC (4, 10). 

RESULTS 

Amplification of cDNA. Three amino acid sequences (Fig. 
1) were chosen for the design of sense and anti sense oligo- 
nucleotide primers. Deoxyinosine was substituted in posi- 
tions where codon degeneracy was >2 (16). PCR was carried 
out with all six possible combinations of sense and antisense 
primers. Primer-dependent products were obtained with two 
of the six incubations— i.e., 2S, 6A (500 bp) and 3S, 6A (450 
bp) (Fig. 3). The complete nucleotide sequence for GnT I is 
shown in Fig. 4. 

Sequence Analysis* The 1.6-kb clone contains 0.5 kb from 
the V end of the coding region and the full 1.1 -kb 3' 
untranslated region (re 1600 in Fig. 2). The 3.0-kb clone 
yielded a 2485-bp sequence (rc2500 in Fig. 2; Fig. 4). We have 
shown (M.S. and H.S., unpublished data) that subcloning of 
the 3.0-kb DN A fragment in pGEM-7Z results in deletion of 
a 0.5-kb DNA fragment near the 5' end of the clone. Com- 
parison of the cDNA sequence shown in Fig. 4 with the 
sequence of human genomic DNA for GnT I (unpublished 
data) has shown that this deleted 0.5-kb DNA fragment is not 
part of the GnT I gene; we do not know the origin of this 
DNA. 

The GnT I coding sequence has 1341 bp and codes for a 
membrane-bound protein of 447 amino acids (M r 52,000). 
There is a single hydrophobic domain (bases 62-136) flanked 
by charged amino acids (Fig. 4). Chou-Fasman rules (28) 
predict that this hydrophobic segment is capable of propa- 
gating an ar-helix, as expected for a transmembrane domain. 
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Fig. 3. Agarose gel electrophoresis (1% agarose) of the products 
of the PCR with rabbit liver cDNA as template and the following 
combinations of oligonucleotides as primers: 2S, 3 A; 2S, 6A; 3S, 2A; 
3S, 6A; 6S, 2A; 6S, 3A (Fig. 1). Conditions of PCR were as described. 
The gel was stained with ethidium bromide (0.5 nt/ml). Primer- 
dependent products were obtained with combinations 2S, 6A (0.50 
kb) and 3S. 6A (0.45 kb). The arrow designates the 0.5 kb DNA 
marker; the remaining standards are at 1.0-kb, 1.6-kb, 2.0-kb, and 
1.0-kb intervals thereafter. 

The presumptive initiation methionine codon is at the ATG 
codon at position 50, which has an adenosine at position 47, 
thereby fulfilling the requirements for an initiation codon 
(29). All eight peptides shown in Fig. 1 (a total of 103 amino 
acid residues) can be identified in the sequence (Fig. 4); an 
additional five tentative assignments also match the se- 
quence. GnT I purified from rabbit liver has a molecular mass 
of about 45 kDa (14). The protein has no Ar-glycans since none 
of the nine asparagine residues are in a typical Asn-Xaa-(Ser 
or Thr) sequence; we have previously shown that rabbit liver 
GnT I binds poorly to lectin/agarose columns (14). If there 
are no or few 0-glycans, a catalyttcaUy active protein of 45 
kDa can be derived by cleavage at about base position 215 
(Fig. 4). 

Comparison of the GnT I sequence with those of several 
previously cloned glycosyltransferases (30-45) revealed no 
sequence homology, but GnT I appears to have a domain 
structure typical of these enzymes (46). Searches of the 
OenBank nucleotide data base (release 62.0) with the coding 
region of GnT I and of the Protein Identification Resource, 
National Biomedical Research Foundation (release 23.0) 
with the GnT I amino acid sequence revealed no significant 
similarities to other sequences. 

The complete sequence has a long 3' untranslated region 
(bases 1391-2479) containing the consensus polyadenylation 
signal AATAAA at position 2435 (47). Long 3' untranslated 
regions are typical of the known glycosyltransferase genes 
and may be a feature present in other Golgi-localized en- 
zymes (16). 

Northern Blot Analysis. The PCR ribo probe was used to 
determine the size of mRN A in rabbit liver. A major band was 
detected at about 3.0 kb with some smearing at lower 
molecular weights (data not shown), indicating that the 2.5-kb 
cDNA clone (Fig. 4) may not be full-length. 

In Vitro Transcription and Translation. Transcription of the 
linearized pG£M-7Z/2.5-kb GnT I cDNA recombinant plas- 
mid followed by translation in the presence of L-[ 33 S]methto- 
nine resulted in the appearance of a strong radioactive 52-kDa 
band on SDS/polyacrylamide gel electrophoresis; this band 
was not seen in control incubations lacking plasmid or 
containing control plasmid (Fig. 5). The molecular weight 
matches the prediction for the open reading frame shown in 
Fig. 4. Table 1 shows the results of GnT I assays carried out 
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L£C ?*E VAL ALA ASM AL A LEO LXU LE V IE? PKS ?HE TPP TKP 



:4C: rgt :ca ytg ret age igg ctg ccg tea gac aat get ett gar gat 
A*G PRO VAL rfO SCR ARC LEU PRC SEA \St ASH ALA LCC ASP ASP 



gec -jag gt a gag ttg gaa egt cag egg gga ctg ttg e*g cag att 
ALA ZLJ VAL SIC LtJ 3LU AUG C.LK ARC ZLt LEU LXU GLN GIN IL£ 

215: agg gag eae cite get etc tqg aqc cag egg tgg aag gig cct act 

APS 5LU HIS HIS ALA lE'J 7RF SEP GLN ARG TRP L*S VAX. FRO TKB 

i^C t oca gec cct ret get cag ccg cat gtg cct gtg sec eca ccg rca 
ALA ALA PJLO PRO ALA GLN rPO EIS VAL PRC VAL TKR PRC PRC PRO 

365: get gtg ate ccc at; ctg gta ate gee tgt gac cge age acc gtc 
ALA VAL ILC FRO :U LEU VAL :LX ALA CYS ASP APG SCR THR VAX 

4iC. cgc cgc t^t ttg g«c «ag eta etg cat tat egg cct tea get gag 
ARC ARC CYS LCC ASP LYS LEU LEU HIS TYB ABC PRC SEP ALA JLU 

4S&: Ttg ttc ccc ate an. gtc age caq gac tqt gqq cat qag gag aea 

leu phe pro :ls :ie VAL SEP cln ASP CtS CLY HIS SLU 3LC THR 

SCO: gee cag gtc att qct tee tat ggc age gea gtc aca eae ate egg 
ALA OUt VAL ILC ALA SEP TYP CLY SCR ALA VAL THP. HIS ILC ARC 

54 S: caa cct gac etg aqc aac att get gtg cag ccc gac cac cgc aag 
mU» PRO ASP LEU SER ASM ILL ALA VAL GLN PRO ASP HIS AAG „YS 

593: :s: cag ggc tac tac aag arc gca egg cat tae cgc tgg gea tzq 
P4£ CLN CLY Tr? 7YA LYS ILC ALA ARC HIS TYP ARC TRP ALA LEU 

$3-i: ggc caa ate ttc cac aat ttc aac tac cca gca get gtg gtg gtg 

sly clw :lc pme his ash par asm typ pro a la ala val val val 

*30: gaa qat gat etc qag gtg gca cca gae ttc ttt gag tae ttc cag 

GLU ASP ASP LEIT GLU VAL ALA PRO ASP PHE PHE GUJ TYR f HE GLH 

7?5: gec act tae cca ctg ttg laa gca gae ccc tec etc tgg tgt gig 
ALA THR 7YP ESQ 1EU L£C LYS ALA ASP PPO SER LEU IP* CYS VAL 

•TO: tct gec tgg aat gac aat ggc aaa gaa =ag atg gta gac teg agt 

SEP. ALA TBP A5N" ASP ASN GLY LYS GLU GLM MET VAL ASP SEP SEP 

915: aag cca qmq tta etc tac cqc aca gat ttc ttt cct qqe tta qgc 
LYS PRO CLt IEC L£f TYP ARC THP ASP PHZ PHE PPC GLY LEO GLY 

560; tgg tta ctg f.q qct -jaa etc tqg get qaa ctq qaq ccc aaq tqq 
TRP LEU LED LEU ALA GLU LEU TRPALA GLC LEO GLO PRO LYS7PP 

305: ccc aaa gee tic tgg gae gae tgg atg ege egg set gag =mq sqm 
PRO LYS ALA PHE TRP ASP ASP TWP WET APS ARS PRO GLU CLW ARC 

950: aaq g<;g ogg gec tqt qtg cgt cca goa ate tea oqa aca atg aca 
LYS GLY ARC ALA CYS VAL ARC PRO GLU ILE SER ARC THR MET THR 



1C4C: etc aaq ttc ate aaq etg aac cag cag :tt qta :ce ttc aec cag 
LEU LYS PRX TLE LYS IXU ASS 3Ui 3LN P8E VAL PRO PHE THR GLU 

109!: ctg gac ctg teg tac ctt eag cag gag gee tat gac tgg gat ttc 
'*EU ASP LEG SER TYP 1EO CLE CUJ GLU ALA TYP ASP ARC ASP PHE 

::j0: et: get cgt gtt tat ggt get ccc cag tta cag gtg gag aaa gtg 
-EV ALA AR3 VAL TYP GLY ALA PRO GUI LEC GLN VAL OLU LYS VAL 



1220: aea ggc jgg gae age ttc aag get ttc <?cc aag gec ctq ggt grc 
TttR GLY APS A<? £EP ?hE LYS ALA ?H£ ALA LYS ALA LEU GLY VAL 

12*5: atg gat gac etc aaa ggt jt* tec aqg get gqa tac egg ggc 

MET A£fr AS? LEU LfS SER GLY VAL PPC ARC ALA GLY TYR ARC CLY 

att gtc acc rt: tta ttc egg ggc rgc cgt gtc cac ctg gcg cee 

:le val thr ?:-:r -jc: ph£ apg gly arc ap'S val his ltj ala pro 
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Fig. 4. Nucleotide sequence (lowercase letters) of the 2.5-kb 
GnT I cDNA clone. The amino acid sequence in the coding region is 
shown in uppercase letters. The positions of the eight peptide 
sequences obtained from proteolytic digests of GnT I (Fig. 1) are 
underlined with a single solid line; the regions of these peptide 
sequences used for oligonucleotide probe synthesis (Fig. 1) are 
additionally underlined with a discontinuous line. The putative 
transmembrane segment (bases 62-136) is underlined with a double 
line. The consensus polyadenylylation signal AATAAA at position 
2435 is underlined. Only the nucleotide sequence is numbered. 




Fig. 5. An autoradiogram of an SDS/polyacrylamidc geJ elec- 
trophoresis experiment showing in vitro transcription and transla- 
tion. mRNA was generated from the 2.5-kb GnT I cDNA and was 
used as the template for in vitro translation using rabbit reticulocyte 
lysate and L-[ 33 S] methionine. Lanes: C, no plasmid in the incubation; 
12, pGEM-7Z containing the 2.5-kb GnT I cDNA with an insert 
between bases 56 and 57 that interrupts the reading frame; 16, 
pGEM-7Z containing the 2.5-kb GnT I cDNA. 

on the transcription-translation incubations. The incubation 
containing the pGEM-7Z/2.5-kb GnT I cDNA recombinant 
plasmid has appreciable GnT I activity, whereas both con- 
trols show low activity. It is concluded that the 2.5-kb 
sequence shown in Fig. 4 can code for the synthesis of 
catalytically active GnT I. 

DISCUSSION 

GnT I catalyzes an essential first step in the conversion of 
high-man nose jV-gJycans to branched hybrid and complex 
Af-glycans (7 f 10). In vitro transcription/translation of the 
2.5-kb cDNA reported in this paper results in GnT I activity, 
indicating that we have cloned the gene for the catalytic 
domain of this important control enzyme. 

At least seven glycosyltransf erases involved in the syn- 
thesis of N- and 0-glycans have been cloned to date (30-45, 
48). These transferases all place sugars in terminal or sub- 
terminal positions; three of them (£-l,4-galactosyl-, a-2,6- 
sialyl-, and a-l,3-GalNAc-transfera$es) have been localized 
to the trans-Golgi cisternae and trans-GoIgi network, at least 
in some tissues. Most of these transferases share no signif- 
icant sequence similarities but have very similar domain 
structures — i.e., a short amino-terminal cytoplasmic tail, a 
16- to 20-amino acid transmembrane segment (noncleavable 
signal-anchor domain), a "stem" or "neck" region of unde- 
termined length, and a long carboxyl-terminal catalytic do- 
main, which is in the Golgi lumen (46). 

The presence of a "neck" region is based on the finding that 
the or-2,6-sialyltransferase (45, 49) and the 0-1,4-gaJactosyl- 
transferase (31) can be cut by proteases to release a smaller 
catalytically active protein lacking the transmembrane do- 
main. The exact length of this "neck" region cannot be stated 
with accuracy because it is not known how much of the 
amino-tenninal sequence can be removed without loss of 
catalytic activity. We have shown that rabbit liver GnT I (14) 
and rat liver GnT II (50, 51) exist in two forms: (i) a large 
amount of presumably membrane-bound material that does 
not adhere to columns and has proven impossible to purify in 
our hands and (ii) a small amount of material that can be 
purified. In the case of GnT I, it is now clear from the 
sequence analysis that the 45-kDa form of the catalytically 
active protein that we have purified has been derived from the 
membrane-bound precursor by proteolytic cleavage at about 
base position 215 in the "neck" region (Fig. 4). Therefore, the 
amino-terminal blockage of this 45-kDa protein must be due 
to chemical modification during GnT I purification. 

Rabbit GnT I; human, mouse, and bovine UDPgalactose: 
GlcNAc-R 0-1,4-galactosyl transferases (EC 2.4.1.38); and 
human LJDP-GaiNAc:Fuca2Gal-R (Gal N Ac to Gal) o>l,3- 
GaiNAc-transferase (EC 2.4.1,40) have an abnormally high 
number of proline residues between the transmembrane 
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Table 1. In vitro transcripckMMnuislalkm of rabbit GnT I cDNA 

OnT I product, nmol per total 
transcription incubation 

Sep^Pak assays HPLC 16-hr 

Conditions of transcription 2 hr 16 hr assays 

No pJasmid — — 021 — 
Control plasmid 0.04 0.21 0.29 
2.3-kb GnT 1 cDNA 0.41 L05 1.32 

RN A was transcribed from rabbit GnT I cDNA and translated in 
vitro as described in the text. OnT I assays were carried out on these 
incubations, and GnT I product was purified either by adsorption and 
ehttion from Sep-Pak C u cartridges or by HPLC. Each GnT I assay 
contained in a total volume of 0.040 ml: 20 nmol of LTDP-iV-{l- l4 0 
acetyl-iv glucosamine (96,000 dpm) and 24 nmol of exogenous ac- 
ceptor Mano6(Man4k3)Man0-hexyi. Incorporation was corrected for 
control GnT I assays lacking exogenous acceptor (0.006 nmol). The 
activity of GnT I present in the controls was due to the presence of 
GnT I in the rabbit reticulocyte lysate. 

domain and the catalytic domain. This protine-rich "neck' 9 
may play a role in positioning the catalytic domain in the 
lumen of the Golgj to enable glycosylation of glycoproteins 
moving along the Golgi lumen. 

The domain structure of GnT I appears to be similar to that 
of the previously cloned glycosy ltransferascs . However, 
GnT I differs from these transferases in being a medial-Golgi 
enzyme, at least in some tissues (52, 53). Although no 
medial-Golgi glycosyltransferase has been cloned to date to 
our knowledge, rat liver a>mannosidase II (also a medial- 
Golgi enzyme) has been partially cloned (16). Comparison 
with GnT I reveals a 16-amino acid sequence in GnT I 
(Leu-His-Tyr-Arg-Pro-Ser-Ala-Glu-Leu-Phe-Pro-Ile-Ile- 
Val-Ser-Gln, bases 431-478, Fig. 4) that shows a high simi- 
larity score to amino acid residues 403-418 in o-mannosidase 
II (Leu-Gln-Tyr-Arg-Asn-Tyr-Glu-Gln-Lcu-Phc-Ser-Tyr- 
Met-Asn-Ser-Gln). 

Note Added in Proof. Prctiminary reports on the cloning and structure 
of the genes for human and rabbit GnT I have been published (54-56). 
We have sequenced a 4-kb section of human genomic DNA con- 
taining a functional promoter and an intronless coding region for a 
445-amino acid protein with GnT I activity. The similarity between 
the rabbit and human enzymes is 85% for the nucleotide coding 
sequences and over 90% for the amino acid sequences. 

We thank Dr. Stanley Zheng for his assistance with many aspects 
of this work, in particular the transcription-translation experiments. 
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